16 research outputs found

    Identification of differentially expressed subnetworks based on multivariate ANOVA

    Get PDF
    <p>Abstract</p> <p>Background</p> <p>Since high-throughput protein-protein interaction (PPI) data has recently become available for humans, there has been a growing interest in combining PPI data with other genome-wide data. In particular, the identification of phenotype-related PPI subnetworks using gene expression data has been of great concern. Successful integration for the identification of significant subnetworks requires the use of a search algorithm with a proper scoring method. Here we propose a multivariate analysis of variance (MANOVA)-based scoring method with a greedy search for identifying differentially expressed PPI subnetworks.</p> <p>Results</p> <p>Given the MANOVA-based scoring method, we performed a greedy search to identify the subnetworks with the maximum scores in the PPI network. Our approach was successfully applied to human microarray datasets. Each identified subnetwork was annotated with the Gene Ontology (GO) term, resulting in the phenotype-related functional pathway or complex. We also compared these results with those of other scoring methods such as <it>t </it>statistic- and mutual information-based scoring methods. The MANOVA-based method produced subnetworks with a larger number of proteins than the other methods. Furthermore, the subnetworks identified by the MANOVA-based method tended to consist of highly correlated proteins.</p> <p>Conclusion</p> <p>This article proposes a MANOVA-based scoring method to combine PPI data with expression data using a greedy search. This method is recommended for the highly sensitive detection of large subnetworks.</p

    High Accordance in Prognosis Prediction of Colorectal Cancer across Independent Datasets by Multi-Gene Module Expression Profiles

    Get PDF
    A considerable portion of patients with colorectal cancer have a high risk of disease recurrence after surgery. These patients can be identified by analyzing the expression profiles of signature genes in tumors. But there is no consensus on which genes should be used and the performance of specific set of signature genes varies greatly with different datasets, impeding their implementation in the routine clinical application. Instead of using individual genes, here we identified functional multi-gene modules with significant expression changes between recurrent and recurrence-free tumors, used them as the signatures for predicting colorectal cancer recurrence in multiple datasets that were collected independently and profiled on different microarray platforms. The multi-gene modules we identified have a significant enrichment of known genes and biological processes relevant to cancer development, including genes from the chemokine pathway. Most strikingly, they recruited a significant enrichment of somatic mutations found in colorectal cancer. These results confirmed the functional relevance of these modules for colorectal cancer development. Further, these functional modules from different datasets overlapped significantly. Finally, we demonstrated that, leveraging above information of these modules, our module based classifier avoided arbitrary fitting the classifier function and screening the signatures using the training data, and achieved more consistency in prognosis prediction across three independent datasets, which holds even using very small training sets of tumors

    An integrative approach to identifying cancer chemoresistance-associated pathways

    Get PDF
    <p>Abstract</p> <p>Background</p> <p>Resistance to chemotherapy severely limits the effectiveness of chemotherapy drugs in treating cancer. Still, the mechanisms and critical pathways that contribute to chemotherapy resistance are relatively unknown. This study elucidates the chemoresistance-associated pathways retrieved from the integrated biological interaction networks and identifies signature genes relevant for chemotherapy resistance.</p> <p>Methods</p> <p>An integrated network was constructed by collecting multiple metabolic interactions from public databases and the k-shortest path algorithm was implemented to identify chemoresistant related pathways. The identified pathways were then scored using differential expression values from microarray data in chemosensitive and chemoresistant ovarian and lung cancers. Finally, another pathway database, Reactome, was used to evaluate the significance of genes within each filtered pathway based on topological characteristics.</p> <p>Results</p> <p>By this method, we discovered pathways specific to chemoresistance. Many of these pathways were consistent with or supported by known involvement in chemotherapy. Experimental results also indicated that integration of pathway structure information with gene differential expression analysis can identify dissimilar modes of gene reactions between chemosensitivity and chemoresistance. Several identified pathways can increase the development of chemotherapeutic resistance and the predicted signature genes are involved in drug resistant during chemotherapy. In particular, we observed that some genes were key factors for joining two or more metabolic pathways and passing down signals, which may be potential key targets for treatment.</p> <p>Conclusions</p> <p>This study is expected to identify targets for chemoresistant issues and highlights the interconnectivity of chemoresistant mechanisms. The experimental results not only offer insights into the mode of biological action of drug resistance but also provide information on potential key targets (new biological hypothesis) for further drug-development efforts.</p

    Geometric Interpretation of Gene Coexpression Network Analysis

    Get PDF
    The merging of network theory and microarray data analysis techniques has spawned a new field: gene coexpression network analysis. While network methods are increasingly used in biology, the network vocabulary of computational biologists tends to be far more limited than that of, say, social network theorists. Here we review and propose several potentially useful network concepts. We take advantage of the relationship between network theory and the field of microarray data analysis to clarify the meaning of and the relationship among network concepts in gene coexpression networks. Network theory offers a wealth of intuitive concepts for describing the pairwise relationships among genes, which are depicted in cluster trees and heat maps. Conversely, microarray data analysis techniques (singular value decomposition, tests of differential expression) can also be used to address difficult problems in network theory. We describe conditions when a close relationship exists between network analysis and microarray data analysis techniques, and provide a rough dictionary for translating between the two fields. Using the angular interpretation of correlations, we provide a geometric interpretation of network theoretic concepts and derive unexpected relationships among them. We use the singular value decomposition of module expression data to characterize approximately factorizable gene coexpression networks, i.e., adjacency matrices that factor into node specific contributions. High and low level views of coexpression networks allow us to study the relationships among modules and among module genes, respectively. We characterize coexpression networks where hub genes are significant with respect to a microarray sample trait and show that the network concept of intramodular connectivity can be interpreted as a fuzzy measure of module membership. We illustrate our results using human, mouse, and yeast microarray gene expression data. The unification of coexpression network methods with traditional data mining methods can inform the application and development of systems biologic methods

    Heat shock partially dissociates the overlapping modules of the yeast protein-protein interaction network: a systems level model of adaptation

    Get PDF
    Network analysis became a powerful tool in recent years. Heat shock is a well-characterized model of cellular dynamics. S. cerevisiae is an appropriate model organism, since both its protein-protein interaction network (interactome) and stress response at the gene expression level have been well characterized. However, the analysis of the reorganization of the yeast interactome during stress has not been investigated yet. We calculated the changes of the interaction-weights of the yeast interactome from the changes of mRNA expression levels upon heat shock. The major finding of our study is that heat shock induced a significant decrease in both the overlaps and connections of yeast interactome modules. In agreement with this the weighted diameter of the yeast interactome had a 4.9-fold increase in heat shock. Several key proteins of the heat shock response became centers of heat shock-induced local communities, as well as bridges providing a residual connection of modules after heat shock. The observed changes resemble to a "stratus-cumulus" type transition of the interactome structure, since the unstressed yeast interactome had a globally connected organization, similar to that of stratus clouds, whereas the heat shocked interactome had a multifocal organization, similar to that of cumulus clouds. Our results showed that heat shock induces a partial disintegration of the global organization of the yeast interactome. This change may be rather general occurring in many types of stresses. Moreover, other complex systems, such as single proteins, social networks and ecosystems may also decrease their inter-modular links, thus develop more compact modules, and display a partial disintegration of their global structure in the initial phase of crisis. Thus, our work may provide a model of a general, system-level adaptation mechanism to environmental changes.Comment: 24 pages, 6 figures, 2 tables, 70 references + 22 pages 8 figures, 4 tables and 8 references in the enclosed Supplemen

    A Scalable Approach for Discovering Conserved Active Subnetworks across Species

    Get PDF
    Overlaying differential changes in gene expression on protein interaction networks has proven to be a useful approach to interpreting the cell's dynamic response to a changing environment. Despite successes in finding active subnetworks in the context of a single species, the idea of overlaying lists of differentially expressed genes on networks has not yet been extended to support the analysis of multiple species' interaction networks. To address this problem, we designed a scalable, cross-species network search algorithm, neXus (Network - cross(X)-species - Search), that discovers conserved, active subnetworks based on parallel differential expression studies in multiple species. Our approach leverages functional linkage networks, which provide more comprehensive coverage of functional relationships than physical interaction networks by combining heterogeneous types of genomic data. We applied our cross-species approach to identify conserved modules that are differentially active in stem cells relative to differentiated cells based on parallel gene expression studies and functional linkage networks from mouse and human. We find hundreds of conserved active subnetworks enriched for stem cell-associated functions such as cell cycle, DNA repair, and chromatin modification processes. Using a variation of this approach, we also find a number of species-specific networks, which likely reflect mechanisms of stem cell function that have diverged between mouse and human. We assess the statistical significance of the subnetworks by comparing them with subnetworks discovered on random permutations of the differential expression data. We also describe several case examples that illustrate the utility of comparative analysis of active subnetworks

    Protein–protein interaction networks suggest different targets have different propensities for triggering drug resistance

    No full text
    Emergence of drug resistance is a major problem in the treatment of many diseases including tuberculosis. To tackle the problem from a wholistic perspective, it is essential to understand the molecular mechanisms by which bacteria acquire drug resistance using a systems approach. Availability of genome-scale data of expression profiles under different drug exposed conditions and protein–protein interactions, makes it feasible to reconstruct and analyze systems-level models. A number of proteins involved in different resistance mechanisms, referred to as the resistome are identified from literature. The interaction of the drug directly with the resistome is unable to explain most resistance processes adequately, including that of increased mutations in the target’s binding site. We recently hypothesized that some communication might exist from the drug environment to the resistome to trigger emergence of drug resistance. We report here a network based approach to identify most plausible paths of such communication in Mycobacterium tuberculosis. Networks capturing both structural and functional linkages among various proteins were weighted based on gene expression profiles upon exposure to specific drugs and betweenness centrality of the interactions. Our analysis suggests that different drug targets and hence different drugs could trigger the resistome to different extents and through different routes. The identified paths correlate well with the mechanisms known through experiment. Some examples of the top ranked hubs in multiple drug specific networks are PolA, FadD1, CydA, a monoxygenase and GltS, which could serve as co-targets, that could be inhibited in order to retard resistance related communication in the cell
    corecore